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Abstract. We prove that every odd number N greater than 1 can be expressed as 
the sum of at most five primes, improving the result of Ramare that every even natural 
£NJ , number can be expressed as the sum of at most six primes. We follow the circle method 

of Hardy-Littlewood and Vinogradov, together with Vaughan's identity; our additional 
techniques, which may be of interest for other Goldbach-type problems, include the use 
of smoothed exponential sums and optimisation of the Vaughan identity parameters to 
save or reduce some logarithmic losses, the use of multiple scales following some ideas 
of Bourgain, and the use of Montgomery's uncertainty principle and the large sieve 
\ to improve the L 2 estimates on major arcs. Our argument relies on some previous 

numerical work, namely the verification of Richstein of the even Goldbach conjecture 
up to 4 x 10 14 , and the verification of van de Lune and (independently) of Wedeniwski 
■ of the Riemann hypothesis up to height 3.29 x 10 9 . 

■i— > 

a 

1. Introduction 



Two of most well-known conjectures in additive number theory are the even and odd 
Goldbach conjectures, which we formulate as follow^ 



> 
in 

■ Conjecture 1.1 (Even Goldbach conjecture). Every even natural number x can be 

expressed as the sum of at most two primes. 

^ \ Conjecture 1.2 (Odd Goldbach conjecture). Every odd number x larger than 1 can be 

expressed as the sum of at most three primes. 

It was famously established by Vinogradov [37], using the Hardy-Littlewood circle 
method, that the odd Goldbach conjecture holds for all sufficiently large odd x. Vino- 
gradov's argument can be made effective, and various explicit thresholds for "sufficiently 
large" have been given in the literature; in particular, Chen and Wang [5] established the 
odd Goldbach conjecture for all x ^ exp(exp(l 1.503)) ~ exp(99012), and Liu & Wang 
|24j subsequently extended this result to the range x ^ exp(3100). At the other extreme, 
by combining Richstein's numerical verification [31] of the even Goldbach conjecture for 
x ^ 4 x 10 14 with effective short intervals containing primes (based on a numerical 
verification of the Riemann hypothesis by van de Lune and Wedeniwski [50]), Ramare 
and Saouter [30] verified the odd Goldbach conjecture for n ^ 1.13 x 10 22 ~ exp(28). 
By using subsequent numerical verifications of both the even Goldbach conjecture and 
the Riemann hypothesis, it is possible to increase this lower threshold somewhat, but 



1 The odd Goldbach conjecture is also often formulated in an almost equivalent (and slightly stronger) 
fashion as the assertion that every odd number greater than seven is the sum of three odd primes. 
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there is still a very significant gap between the lower and upper thresholds for which 
the odd Goldbach conjecture is knownB 

To explain the reason for this, let us first quickly recall how Vinogradov-type theorems 
are proven. To represent a number x as the sum of three primes, it suffices to obtain a 
sufficiently non-trivial lower bound for the sum 

A(ni)A(n 2 )A(n 3 ) 

ni ! n2,n 3 :n 1 +n2+n 3 =x 

where A is the von Mangoldt function (see Section[2]for definitions). By Fourier analysis, 
we may rewrite this expression as the integral 

/ S(x, a) 3 e(— xot) da (1.1) 
</ r/z 

where e(x) := e 2mx and S(x,a) is the exponential sum 

S(x, a) := A{n)e{na). 

n^x 

The objective is then to obtain sufficiently precise estimates on S(x, a) for large x. Using 
the Dirichlet approximation theorem, one can approximate a = - + for some natural 
number 1 ^ q ^ Q, some integer a with (a, q) — 1, and some with \(3\ sC where Q 
is a threshold (somewhat close to N) to be chosen later. Roughly speaking, one then 
divides into the major arc case when q is small, and the minor arc case when q is large 
(in practice, one may also subdivide these cases into further cases depending on the 
precise sizes of q and (3). In the major arc case, the sum S(x, a) can be approximated 
by sums such as 

S(x,a/q) = A(n)e(an/q), 

which can be controlled by a suitable version of the prime number theorem in arithmetic 
progressions, such as the Siegel-Walfisz theorem. In the minor arc case, one can instead 
follow the methods of Vinogradov, and use truncated divisor sum identities (such as 
(variants of) Vaughan's identity, see Lemma 14 . 1 1 1 below) to rewrite S(x, a) into various 
"type I" sums such as 

^^/i(<f) log ne(adn) 

d^U ni^x/d 

and "type II" sums such as 

E5>^)( E MbMadw) (1.2) 

d>Uw>V b\w:b>V 

which one then estimates by standard linear and bilinear exponential sum tools, such 
as the Cauchy-Schwarz inequality. 



We remark however that the odd Goldbach conjecture is known to be true assuming the generalised 
Riemann hypothesis; see [5]. 
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A typical estimate in the minor arc case takes the form 

\S(x,a) \ < I ^ + — ^= + x 4/5 ) log 3 x; (1.3) 




see e.g. [191 Theorem 13.6]. An effective version of this estimate (with slightly different 
logarithmic powers) was given by Chen & Wang [Bj; see (11.131) below. Note that one 

cannot hope to obtain a bound for S(x, a) that is better than — without making 
progress on the Siegel zero problem (in order to decorrelate A from the quadratic char- 
acter with conductor q); see [35] for further discussion. In a similar spirit, one should 
not expect to obtain a bound better than — f= unless one exploits a non-trivial error 

term in the prime number theorem for arithmetic progressions (or equivalently, if one 
shows that L-functions do not have a zero on the line 3?s = 1), as one needs to decorre- 
latd A from Archimedean characters n lt with t comparable to x/q, (or from combined 
characters v(n)n**). 

If one combines (11.31) with the L 2 bound 

/ \S(x,a)\ 2 da < a; log a; (1.4) 
Jr/z 

arising from the Plancherel identity and the prime number theorem, one can obtain 
satisfactory estimates for all minor arcs with q ^> log 8 x (assuming that Q was chosen 
to be significantly less than x/log 8 x). To finish the proof of Vinogradov's theorem, 
one thus needs to obtain good prime number theorems in arithmetic progressions whose 
modulus q can be as large as log 8 x. While explicitly effective versions of such theorems 
exist (see e.g. [27], [23], [39], [11], [20]), their error term decays very slowly (and is 
sensitive to the possibility of a Siegel zero for one exceptional modulus g); in particular, 
errors of the form 0(x exp(— c^Jlogx)) for some explicit but moderately small constant 
c > are typical. Such errors can eventually overcome logarithmic losses such as logx, 
but only for extremely large x, e.g. for x much larger than 10 100 , which can be viewed 
as the principal reason why the thresholds for Vinogradov-type theorems are so large. 

It is thus of interest to reduce the logarithmic losses in (11. 3p . particularly for moderately 
sized x such as x ~ 10 30 , as this would reduce the rangqj of moduli q that would have 
to be checked in the major arc case, allowing for stronger prime number theorems in 
arithmetic progressions to come into play. In particular, the numerical work of Piatt 



3 Indeed, suppose for sake of heuristic argument that the Riemann zeta function had a zero very 
close to 1 + ix/q, then the von Mangoldt explicit formula J2 n <x ^( n ) wou ld contain a term close to 
— Sn<a; n~ lx l q , which suggests upon summation by parts that S(x : a) would contain a term close to 
_ En<i Ii ~' I ''' e ( a;E )i which can be of size comparable to x/y/x/q when a is close to (say) l/q. A 
similar argument involving Dirichlct i-functions also applies when a is close to other multiples of l/q. 
This suggests that one would need to use zero-free regions for zeta functions and L-functions in orderto 

improve upon the , x bound. 

V x /i 

4 It seems however that one cannot eliminate the major arcs entirely. In particular, one needs to 
prevent the majority of the primes from concentrating in the quadratic non-residues modulo q for q = 3, 
q = 4, or q = 5, as this would cause the residue class mod q to have almost no representations as 
the sum of three primes. 
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has established zero-free regions for Dirichlet L-functions of conductor up to 10 5 
(building upon earlier work of Rumely [39] who obtained similar results up to conductor 
10 2 ), and it is likely that such results would be useful in future work on Goldbach-type 
problems. 

Several improvements or variants to fll.3p are already known in the literature. For 
instance, prior to the introduction of Vaughan's identity in [4"B"] : Vinogradov [48J had 
already established a bound of the shape 



\S(x,a)\ < ( ^— + — = + exp(-± v / logx) ) log 11/2 



x 



by sieving out small primes. This was improved by Chen [I] and Daboussi [7] to obtain 



\S(x,a)\ <C ( -^z H 1= + exp(-|- v /logx) ) log 3/4 x(log log x) 



1/2 



and the constants made explicit (with some slight degradation in the logarithmic ex- 
ponents) by Daboussi and Rivat [8]. Unfortunately, due to the slow decay of the 
exp(— ^logx) term, this estimate only becomes non-trivial for x ^ 10 184 (see [8] §8]), 
and is thus not of direct use for smaller values of x. 



In [2], Buttkewitz obtained the bound 

\S(x,a) \ <a — log 1/4 x H —log log log x + - — j— (1.5) 

y/q x/q \og A x 

for any A ^ 1 (assuming (5 extremely small), using Lavrik's estimate [22] on the average 
error term in twin prime type problems, but the constants here are ineffective (as 
Lavrik's estimate uses the Siegel-Walfisz theorem). Finally, in the "weakly minor arc" 
regime log q ^ log 1 / 3 x, Ramare [37] used the Bombieri sieve to obtain an effective 
bound of the form 

\S(x,a)\^^ (1.6) 

which, as mentioned previously, is an essentially sharp effective bound in the absence 
of any progress on the Siegel zero problem. Unfortunately, the constants in [37J are not 
computed explicitly, and seem to be far too large to be useful for values of x such as 
10 30 . 



In this paper we will not work directly with the sums S(x,a), but instead with the 
smoothed out sums 

S mo (x,a) := ^ A(ri)e(an) l^ !qo ) =1 r](n/x) 

n 

for some (piecewise) smooth functions r\ : R — > C and a modulus q . The modulus q is 
only of minor technical importance, and should be ignored at a first reading (see Lemma 
14. II for a precise version of this statement). For sake of explicit constants we will often 
work with a specific choice of cutoff rj, namely the cutoff 

77o(t):=4(log2-|log2t|) + , (1.7) 
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which is a Lipschitz continuous cutoff of unit mass supported on the interval [1/4, 1]. 
The use of smooth cutoffs to improve the behaviour of exponential sums is of course 
well established in the literature (indeed, the original work of Hardy and Littlewood [16J 
on Goldbach-type problems already used smoothed sums). The reason for this specific 
cutoff is that one has the identity 



CO 



dW 



Vo(dw/x) = Aj^ jW] (w) — (1.8) 

for any d,w,x > 0, which will be convenient for factorising the Type II sums into a 
tractable form. In some cases we will take go to equal 2, in order to restrict all sums to 
be over odd numbers, rather than all natural numbers; this has the effect of saving a 
factor of two in the explicit constants. Because of this, though, it becomes natural to 
approximate 4a by a rational number a/q, rather than a itself. 

Our main exponential sum result can be stated as follows. 

Theorem 1.3 (Exponential sum estimate). Let x ^ 10 20 be a real number, and let 
4a = - + for some natural number 100 ^ q ^ x/100 with (a,q) = 1 and some 
G [— 1/g 2 , 1/g 2 ]. Let qo be a natural number, such that all prime factors of qo do not 
exceed y/x. Then 

\S Voqo {x,a)\ ^ (0.14— + 0.64^L= + 0.15x 4/5 ) log x{\ogx + 11.3) (1.9) 
\ VQ \jxjq J 

If q ^ x 1 / 3 , one has the refinement 

\S Vo>qo (x,a)\ ^ 0.5-(log2x)(log2x + 15) + 0.31^ log g(logg + 8.9) (1.10) 

and similarly when q ^ x 2 ^ 3 , one has the refinement 

IS^MI <3.12-^-(bg2ar)0ogg + 8) + 1.19-J=log-(log- + 2.3). (1.11) 
X /Q \/x/q q 1 

If q ^ £ 2//3 and a = ±1, one has the further refinement 

\S vom (x, a)\ ^ 9.73—^ log 2 x + 1.2-5= log -(log - + 2.4). (1.12) 



The first estimate (II. 9p is basically a smoothed out version of the standard estimate 
(jl.3|) . with the Lipschitz nature of the cutoff r]o being responsible for the saving of one 
logarithmic factor, and will be proven by basically the same method (i.e. Vaughan's 
identity, followed by linear and bilinear sum estimates of Vinogradov type); it can be 
compared for instance with the explicit estimate 

| Si [01] i(x,4a)| ^ 0.177— log 3 x + 0.08— = log 3 ' 5 x + 3.8x 4/5 log 2 ' 2 x (1.13) 

sjxjq 

of Chen & Wang [6], rewritten in our notation. In practice, for x of size 10 30 or so, the 
estimate ( II. 3p improves upon the Chen- Wang estimate by about one to two orders of 
magnitude, though this is admittedly for a smoothed version of the sum considered in 



6 



TERENCE TAO 



The estimate (11.91) is non-trivial in the regime log 4 x <C q <C x/\og A x. The key point, 
though, is that one can obtain further improvement over (11 .9p in the most important 
regimes when q is close to 1 or to x, by reducing the argument of the logarithm (or 
by squaring the denominator); indeed, (II. 9p when combined with (jl.lOp . ( II. lip is now 
non-trivial in the larger range log 2 x <C q <C x/log 2 x, and with (I1.12p one can extend 
the upper threshold of this range from xj log 2 x to xj logx, at least when a = ±1. 

The bounds in Theorem [L3] are basically achieved by optimising in the cutoff parameters 
U,V in Vaughan's identity, and (in the case of ( I1.12p ) a second integration by parts, 
exploiting the fact that the derivative of r] has bounded total variation. Asymptotically, 
the bounds here are inferior to those in (11.51) . (jl.6p . but unlike those estimates, the 
constants here are reasonable enough that the bounds are useful for medium-sized values 
of x, for instance for x between 10 30 and 10 1300 . There is scope for further improvement!! 
in the exponential sum bounds in these ranges by eliminating or reducing more of the 
logarithmic losses (and we did not fully attempt to optimise all the explicit constants), 
but the bounds indicated above will be sufficient for our applications. 

We will actually prove a slightly sharper (but more technical) version of Theorem 11.31 
with two additional parameters U and V that one is free to optimise over; see Theorem 
15.11 below. 

As an application of these exponential sum bounds, we establish the following result: 

Theorem 1.4. Every odd number x larger than 1 can be expressed as the sum of at 
most five primes. 

This improves slightly upon a previous result of Ramare [35], who showed (by a quite 
different method) that every even natural number can be expressed as the sum of at 
most six primes. In particular, as a corollary of this result we may lower the upper 
bound on Shnirelman's constant (the least number k such that all natural numbers 
greater than 1 can be expressed as the sum of at most k primes) from seven to six 
(note that the even Goldbach conjecture would imply that this constant is in fact three, 
and is in fact almost equivalent to this claim). We remark that Theorem 11.41 was also 
established under the assumption of the Riemann hypothesis by Kaniecki [21] (indeed, 
by combining his argument with the numerical work in [UJ , one can obtain the stronger 
claim that any even natural number is the sum of at most four primes). 

Our proof of Theorem 1 1 . 41 also establishes that every integer x larger than 8.7 x 10 36 can 
be expressed as the sum of three primes and a natural number between 2 and 4 x 10 14 ; 
see Theorem 18.21 below. 



Indeed, since the release of an initial preprint of this paper, such improvements to the above bounds 
have been achieved in |18j . The author expects however that in the case of most critical interest, namely 
when q is very close to 1 or to x, the Vaughan identity-based methods in the above theorem are not 
the most efficient way to proceed. Thus, one can imagine in future applications that Theorem 11.31 (or 
variants thereof) could be used to eliminate from consideration all values of q except those close to 
1 and x, and then other methods (e.g. those based on the zeroes of Dirichlet L-functions, or more 
efficient identities than the Vaughan identity) could be used to handle those cases. 
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To prove Theorem 11.44 we will a ls° need to rely on two numerically verified results in 
addition to Theorem 11.31 

Theorem 1.5 (Numerical verification of Riemann hypothesis). Let T := 3.29 x 10 9 . 

Then all the zeroes of the Riemann zeta function ( in the strip {s : < < 1; ^ 
$s (s) ^ T } lie on the line dt(s) = 1/2. Furthermore, there are at most 10 10 zeroes in 
this strip. 



Proof. This was achieved independently by van de Lune (unpublished), by Wedeniwski 
[50J , by Gourdon [TJ] , and by Piatt [31] . Indeed, the results of Wedeniwski allow one 
to take T as large as 5.72 x 10 10 , and the results of Gourdon allow one to take T 
as large as 2.44 x 10 12 ; using interval arithmetic, Piatt also obtained this result with 
To as large as 3.06 x 10 10 . (Of course, in these latter results there will be more than 
10 10 zeroes.) However, we will use the more conservative value of To = 3.29 x 10 9 in 
this paper as it suffices for our purposes, and has been verified by four independent 
numerical computations. □ 

Theorem 1.6 (Numerical verification of even Goldbach conjecture). Let Nq :— 4x 10 14 . 

Then every even number between 4 and Nq is the sum of two primes. 



Proof. This is the main result of Richstein [4"T] . A subsequent (unpublished) verification 
of this conjecture by the distributed computing project of Oliveira e Silva [32] allows one 
to take iVo as large as 2.6 x 10 18 (with the value No = 10 17 being double-checked), but 
again we shall use the more conservative value of Nq = 4 x 10 14 in this paper as it suffices 
for our purposes, and has been verified by three independent numerical computations. 
□ 

The proof of Theorem 11.41 is given in Section [H] proceeds according to the circle method 
with smoothed sums as discussed earlier, but with some additional technical refinements 
which we now discuss. The first refinement is to take advantage of Theorem 11.61 to 
reduce the five-prime problem to the problem of representing a number x as the sum of 
three primes n\,n2,n^ and a number between 2 and Nq. As far as the circle method is 
concerned, this effectively restricts the frequency variable a to the arc {a : ||«|| R / Z <C 
1/iVo}. At the other end of the spectrum, by using Theorem 11.51 and the von Mangoldt 
explicit formula one can control quite precisely the contribution of the major arc {a : 
IMIr/z "C T q /x}; see Proposition 17.21 Thus leaves only the "minor arc" 

| « IMU/s « ± (i.i4) 

that remains to be controlled. 

By using Theorem 11.61 and Theorem 11.51 (or more precisely, an effective prime number 
theorem in short intervals derived from Theorem 11.51 due to Ramare and Saouter [40]). 
we will be able to assume that x is moderately large (and specifically, that x ^ 8.7 x 
10 36 ). By using existing Vinogradov-type theorems, we may also place a large upper 
bound on x; we will use the strongest bound in the literature, namely the bound x ^ 
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exp(3100) o£| Liu & Wang [21]. In particular, log a; is relatively small compared to the 
quantities N and T , allowing one to absorb a limited number of powers of logx in the 
estimates. 

It remains to obtain good L 2 and L°° estimates on the minor arc region (11.141) . We will 
of course use Theorem 11.31 for the L°° estimates. A direct application of the Plancherel 
identity would cost a factor of log x in the L 2 estimates, which turns out to be unac- 
ceptable. One can use the uncertainty principle of Montgomery [28] to cut this loss 
down to approximately 2 ^°^ (see Corollary 14. 7j) . but it turns out to be more efficient 
still to use a large sieve estimate of Siebert [15] on the number of prime pairs (p, p + h) 
less than x for various h to obtain an L 2 estimate which only loses a factor of 8 (see 
Proposition 14. lOj) . 

In order to nearly eliminate some additional "Archimedean" losses arising from convolv- 
ing together various cutoff functions rj on the real line R, we will use a trick of Bourgain 
[3J, and restrict one of the three summands ni, n 2 , n 3 to a significantly smaller order of 
magnitude (of magnitude x/K instead of x for some K, which we will set to be 10 3 , in 
order to be safely bounded away from both 1 and T ). By estimating the exponential 
sums associated to ni,n,2 in L 2 and the sum associated to in L°°, one can avoid 
almost all Archimedean losses. 

As it turns out, the combination of all of these tricks, when combined with the expo- 
nential sum estimate and the numerical values of N Q and T , are sufficient to establish 
Theorem 11.41 However, there is one final trick which could be used to reduce certain 
error terms further, namely to let K vary over a range (e.g. from 10 3 to 10 6 ) instead of 
being fixed, and average over this parameter. This turns out to lead to an additional 
saving (of approximately an order of magnitude) in the weakly minor arc case when a 
is slightly larger than Tq/x. While we do not actually utilise this trick here as it is not 
needed, it may be useful in other contexts, and in particular in improving the known 
upper threshold for Vinogradov's theorem. 
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The numerology in our argument is such that one could also use the weaker bound x ^ 
exp(exp(11.503)) provided by Chen & Wang [5], provided one assumed the even Goldbach conjec- 
ture to be verified up to N = 10 17 , a fact which has been double-checked in [32]. Alternatively, if one 
uses the very recent minor arc bounds in |18) that appeared after the publication of this paper, then 
no Vinogradov type theorem is needed at all to prove our result, as the bounds in |18) do not contain 
any factors of logx that need to be bounded. 
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2. Notation 



All summations in this paper will be over the natural numbers N = {1, 2, 3, . . .} unless 
otherwise indicated, with the exceptions of sums over the index p, which are always 
understood to be over primes. 

We use (a, b) to denote the greatest common divisor of two natural numbers, and [a, b] 
for the least common multiple. We write a\b to denote the assertion that a divides b. 

Given a statement E, we use 1^ to denote its indicator, thus 1e equals 1 when E is 
true and otherwise. Thus, for instance l( n ,g)=i is equal to 1 when n is coprime to q, 
and equal to otherwise. 

We will use the following standard arithmetic functions. If n is a natural number, then 

• A(n) is the von Mangoldt function of n, defined to equal \ogp when n is a power 
of a prime p, and zero otherwise. 

• /x(n) is the Mobius function of n, defined to equal (— l) k when n is the product 
of k distinct primes, and zero otherwise. 

• 0(n) is the Euler totient function of n, defined to equal the number of residue 
classes of n that are coprime to n. 

• uj(n) is the number of distinct prime factors of n. 

Given two arithmetic functions /, g : N — > C, we define the Dirichlet convolution f * g : 
N — > C by the formula 

f*9(n) 

d\n 

thus for instance \i * l{n) = l„=o, A * l(n) = log(n), and ji * log(n) = A(n). 

Given a positive real Q, we also define the primorial Q% of Q to be the product of all 
the primes up to Q: 

Thus, for instance l( n ,Q$)=i equals 1 precisely when n has no prime factors less than or 
equal to Q. 

We use the usual LP norms 

II/IIlp(r) == (/ l/OOP ^ 

for 1 ^ p < oo, with ||/||l°o(r) denoting the essential supremum of /. Similarly for other 
domains than R which have an obvious Lebesgue measure; we define the £ p norms for 
discrete domains (such as the integers Z ) in the usual manner. 

We use R/z to denote the unit circle. By abuse of notation, any 1-periodic function on 
R is also interpreted as a function on R/z, thus for instance we can define | sin(7ra:)| for 
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any a G R/z. We let ||ot|| R / z be the distance to the nearest integer for a G R, and this 
also descends to R/z. We record the elementary inequalities 

2 1| o; j| R / z ^ |sin(7ra)| ^ vr||a|| R / z ^ |tan(7ra)|, (2.1) 

valid for any a G R/z. In a similar vein, if a G Z/qZ, we consider - as an element of 

r/z. 

When considering a quotient y with a non-negative denominator K, we adopt the 
convention that tf = +oo when Y is zero. Thus for instance -n— it — is equal to +oo 
when a is an integer. 

We will occasionally use the usual asymptotic notation X = 0(Y) or I C 7 to 
denote the estimate \X\ ^ CY for some unspecified constant C . However, as we will 
need explicit bounds, we will more frequently (following Ramare [35J) use the exact 
asymptotic notation X = 0*(Y) to denote the estimate \X\ ^ Y. Thus, for instance, 
X = Y + 0*(Z) is synonymous with \X - Y | < Z. 

If F : R — > C is a smooth function, we use F', F" to denote the first and second 
derivatives of F, and F^ to denote the fc-fold derivative for any k ^ 0. 

If a: is a real number, we denote e(x) := e 2mx , we denote x + := max(x, 0), and we 
denote [^J to be the greatest integer less than or equal to x. We interpret the x x + 
operation to have precedence over exponentiation, thus for instance x\ denotes the 
quantity max(x, 0) 2 rather than max(x 2 , 0). 

If E is a finite set, we use \E\ to denote its cardinality. If / is an interval, we use |/| to 
denote its length. We will also occasionally use translations I + x ■— {y + x : y G /} 
and dilations XI := {Xy : y G 1} of such an interval. 

The natural logarithm function log takes precedence over addition and subtraction, but 
not multiplication or division; thus for instance 

log 2x + 15 = (log(2ac)) + 15. 



3. Exponential sum estimates 



We now record some estimates on linear exponential sums such as 

F(n)e(an) 

nez 

for various smooth functions F : R — > C, as well as bilinear exponential sums such as 

}] 22 a nb m e(anm) 

nez mSZ 

for various sequences (a n ) nez and (b m ) mez . These bounds are standard (at least if one 
is only interested in bounds up to multiplicative constants), but we will need to make 
all constants explicit. Also, in order to save a factor of two or so in the explicit bounds, 
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we will frequently restrict the summation to odd integers by inserting weights such as 
l(n,2)=i, and will therefore need to develop variants of the standard bounds for this 
purpose. 

We begin with some standard bounds on linear exponential sums with smooth cutoffs. 

Lemma 3.1. Let a G R/Z, and let F : R — > C be a smooth, compactly supported 
function. Then we have the bounds 



$>(n) = f F(y)dy + 0* (|||F'|| il(R) ) 

n6Z ^ R 



and 



and 



F(n)e(an) 



ne z 



||F|| L1(R) + i||F'|| L] 



(R) 



F(n)e(an) 



nez 



<r ii F fc II 

zsin to r 



/or a// natural numbers k ^ 1. 



(3.1; 



(3.2) 



(3.3) 



Proof. These bounds appear in [T3] and in [2HJ Lemma 1.1], but we reproduce the proof 
here for the reader's convenience. From the fundamental theorem of calculus one has 

tpn+l/2 
J \F'(t)\dt 

for all y G [n, n + 1/2] and 

F(n) = F(y) + <D*( T \F\t)\dt 

\</n-l/2 



1/2 

for all y G [n — 1/2, n]; averaging over y, we conclude that 

pn+1/2 / /-n+1/2 

F(n) = / F^dy+iO* / |F'(t)|dt 

Jn-l/2 \Jn~l/2 

Summing over n, we obtain ( 13. ip . Taking norms instead, one obtains 

£|F(n)| < ||i^|UMR) + III^IUmr) 



nez 



giving 



Now we obtain the k = 1 case of ( 13. 3p . We may assume q^O. By summation by parts, 
one has 

J2( F ( n + F{n))e{an) = (1 - e(-a)) ^ F(n)e{ 

nez nez 

and thus 



an 



F(n)e(an) 



nez 



2 sin(7ra; 



1 \J2(F(n + l)-F(n))e(an)\. 



(3.4) 



nez 
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Since \F(n+ 1) —F(ri)\ ^ Jq \F'(n + t)\ dt, we obtain the k = 1 case of (13. 3p . To obtain 
the higher cases, observe from ( I3.4|) . the fundamental theorem of calculus F(n + 1) — 
F(n) = Jq F'{n + 1) dt and Minkowski's inequality that 

| F{n)e{an) \ ^ J | ^ F'(n + t)e(on) | 

for some ^ t < 1. One can now deduce (I3.3P for general k from the /c = 1 case by 
induction. □ 

We can save a factor of two by restricting to odd numbers: 



Corollary 3.2. With the same hypotheses as Lemma \3. 1\ we have 
|2jF(n)e(an)l( 2 ,n)=i| < |||^|Ui(r) + §||-F'||li(r) 

and 

|^F W e(an)l (2 ,„ )ri |< ^L^ ||FW|U, w 



nGZ 

/or all k ^ 1 . 



Proof. We can rewrite 

^F(n)e(an)l (2 ,n)=i = e(a) ^F(2n + l)e(2an). 

n£Z nez 

Applying the previous lemma with a replaced by 2a, and F(x) replaced by F(2x + 1), 
we obtain the claim. □ 

We also record a continuous variant: 

Lemma 3.3. Let F : R — > C be a smooth, compactly supported function. Then one has 

\\F W \\»M 



F(y)e(ay) dy 



(27r|a|) / - 



for any k ^ and a G R. 



Proof. The claim is trivial for k = 0. For higher fc, we may assume a^O. By integration 
by parts, we have 

J F{y)e{ay) dy = J F'{y)e{ay) dy, 

and the claim then follows by induction. □ 

In order to sum the bounds arising from Corollary 13.21 (particularly when k — 1), the 
following lemma is useful. 
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Lemma 3.4 (Vinogradov-type lemma). Let a — - + (3 for some (3 = 0*(l/q 2 ). Then 
for any x < y, A, B > 0, and 9 G R/z , we have 



E 



min ( A 



B 



sin(7r cm + 



y-x 



+ 1 (2A + -BqhgAq). 

71 



Proof. We may normalise B = 1. By subdivision of the interval [x, y] it suffices to show 
that 

I 2 

V min{A,—- — -) ^2A + -glog4g 

' \ smliran + 6)\ ix 

x<n^x+q 1 v ' 1 

for all x. The claim then follows from [SJ Lemma 1]. (Strictly speaking, the phase shift 9 
is not present in the statement of that lemma, but the proof of the lemma is unchanged 
with that phase shift. Alternatively, one can perturb a to be irrational, and then by 
Kronecker's theorem one can obtain a dense set of phase shifts by shifting the interval 
[x, y] by an integer, and the claim for general 9 then follows by a limiting argument.) □ 

Once again, we can save a factor of two by restricting to odd n: 

Corollary 3.5 (Restricting to odd integers). Let 2a = | + (3 for some (3 = 0*(l/q 2 ). 
Then for any x < y, A, B > 0, and 9 G R/z, we have 



Yl min ( A > i - 7 ~ i mi ) \n,2)=l ^ 
^—^ \ | sin(7rcm -•- ' 



x<n^y 



2q 



+ 1 ) (2A+ -BqlogAq). 



Proof. Writing n = 2m + 1, the expression on the left-hand side is 



Emin \A, ; — | 
V I sin(27rom + ira + 9)\) 



0-l)/2<msg(j/-l)/2 

The claim then follows from Lemma 13.51 □ 



Now we turn to bilinear estimates. A key tool here is 

Lemma 3.6 (Large sieve inequality). Let£i, G R/z be such that — £j||r/z ^ $ 

for all 1 ^ i < j ^ R and some 5 > 0. Let I = [ATj, JV*a] be an interval of length 
\I\ = N2 — Ni ^ 1. Then we have 

R 

i=i ne/nz 

for all complex-valued sequences (a n )„ gz . 



Proof. See (30J Theorem 3] (noting that the number of integer points in / is at most 
|/| + 1). ' ' □ 

Specialising to £j that are consecutive multiples of a and applying the Cauchy-Schwarz 
inequality, we conclude 
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Corollary 3.7 (Special case of large sieve inequality). Let I, J C R be intervals of 
length at least 1, and let a G R/z . Then one has 

( 1 Y /2 
\y } a n b m e(nma)\ ^ + — —— \\(a n ) nSZ \\e 2 (z) || {b m )n£Z \\i 2 (z) 

nfl^Zmffnz V ^1K3<\J\\\MU/Zj 

for all complex-valued sequences (a n )„ ez , (6 m ) m6Z ■ 

Again, we can obtain a saving of a factor of 2 in the main term by restricting n, m to 
odd numbers: 

Corollary 3.8 (Restricting to odd numbers). Let I,JcRbe intervals of length at 
least 2, and let a G R/z. Then one has 

( 1 \ V2 

I ^ a " 1 M=l & m 1 (m,2)=l e ( nma )l ^ P' j ' + irif 114 • II IK a n)nGZ ||^(z)||(6 m ) nGZ 

ne/nz meJnz \ 1 i<K§|J| " 3 a " R / z J 



for all complex-valued sequences (a n ) ngz , (6 m ), 



mgZ 



Proof. We can rewrite the left-hand side as 



^2 ^2 a n b m e(4:nma)\ 



ne|/-inz me^J-inz 



where a n := a 2n +ie(2no;) and 6 m := b2 m +ie(2ma) , and the claim then follows from 
Corollary ED □ 

The above corollary is useful whenever the 4ja for 1 ^ j ^ || J| stay far away from the 
origin. When J is large, this is not always the case, but one can of course rectify this 
by a subdivision argument: 

Corollary 3.9 (Subdivision). Let I, J C R be intervals of length at least 2, and let 
a G R/z. Let M ^ 1. Then one has 



Yl a n\n,2)=ib m l( m ,2)=ie{nma)\ ^ + — l — — ) 



1/2 



J 

2M 



+ 1 



II 4 J"I|r/z 

1/2 

Il a n||^ 2 (z) || || £ 2 (Z ) - 



for all complex-valued sequences (a n ) nez , (6 m )mez ■ 



Proof. We subdivide J into intervals Ji, . . . , of length 2M, where /c := |_2l?J +1- The 
claim then follows by applying Corollary 13.81 to each sub-interval, summing, and using 
the Cauchy-Schwarz inequality. □ 
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4. Basic bounds on S Vtq (x,a) 
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In all the lemmas in this section, x ^ 1 is a real number, a G R/z is a frequency, 
r] : R — > R + is a bounded non- negative measurable function supported in [0,1], and 
q ^ 1 is a natural number. In some of the lemmas we will also make the additional 
hypothesis that rj is smooth, although in applications one can often relax this regularity 
requirement by a standard limiting argument. In some cases we will also need 77 to 
vanish near zero. 

The purpose of this section is to collect some basic estimates for manipulating the 
exponential sums 

S Vjq (x, a) := ^ A(n)e(an) l M=1 r](n/x) (4.1) 

n 

that already appeared in the introduction. 

We first make the easy observation that S v , q (x, a) barely depends on the parameter q. 
Lemma 4.1. We have 

S^qix,^ = S Vt i(x,a) + 0*(u(q)\\ri\\ LO o( R ) logx) 
In particular, if the largest prime factor of q is at most \fx, then 
S v , q {x, a) = S v ,x(x, a) + C*(2.52 v / x||?7|U°o(r)) 

In practice, we expect S Vtq (x, a) to be of size comparable to x, and so in practice the 
error terms here will be utterly negligible in applications, and we will be able to absorb 
them without any difficulty into a larger error term. 

Proof. We have 

\S v , q {x, a) - S v ,i{x,a)\ ^ |MU°° (R) E A ( ra )' 

Note that if A(n) is non-zero and (n, q) > 1, then n is a power of a prime p dividing q, 
thus 

n^x:(n,q)>l p\q j:pi^x 

Since Y\. 1 ^ the first claim follows. For the second claim, observe that 



;(g) ^ E 1 ^ 2 - 52: 



logx 

p^y/x 

where the last inequality follows from j43j Corollary 1]. □ 

Next, we make a simple summation by parts observation that allows one to replace rj 
by the sharply truncated cutoff l[ ,i] if desired. 
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Lemma 4.2. If rj is smooth, then one has 

\S m (x,a)\ ||r/|| L i (R) sup \S 1[01hq (y,a)\. 

Proof. Since rj{n/x) = — ^ J* rj'(y /x)l n ^ y dy for all n ^ x, we have 

1 f x 

S VtQ (x,a) = — / r)'(y/x) V" A(n)e(an)l n ^l (ri)? ) = i 
x Jo 



and thus 



\S m (x,a)\ ^ - / |V(y/a;)||5'i [0>1]jg (y,a)| dy, 



o 



X 

and the claim follows. □ 



We trivially have 

\S m (x,a)\ <3 Viq (x,0) (4.2) 
and we now consider the estimation of the quantity S^gfx, 0). 

Lemma 4.3. We have 

S m (x,0) ^ ||»7|U<»(r)S , i [Oi1] ,i(jc,0) ^ 1.04||r/|| L oc (R) a;. (4.3) 

If 7] is smooth and supported on [c, 1] for some c > with cx ^ 10 8 , then 

s v ,i{ x ,°) = \\v\\lhr)X + o* ( ^jpg^ ll^lU 1 ^) • ( 4 - 4 ) 

We remark that sharper estimates can be obtained using the machinery from Section 
U\ at least when rj is fairly smooth, though we will not need such improvements here. 

Proof. The first inequality of (14.31) is trivial, and the final inequality of f !4.3p follows from 
[43, Theorem 12] (indeed, one can replace the constant 1.04 with the slightly smaller 
1.03883). 

For (14.41) , we argue as in Lemma 14.21 and write 

S^O) = -- j X V '{y/x)Y,Kn) dy. 

X Jcx n^y 

From [44"1 Theorem 7] (and the hypothesis cx ^ 10 8 ) one has 



V A(n) =y + 0* ( — ^ ] 

^ K J y V401ogcx/ 

n^y v o / 
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and hence 

1 f x 

Sr,,i(x,o) = — / v'(y/ x )y dy 

J cx 

+ °*(-jkt — / W(y/x)\ydy 

\xA0\ogcx J cx 
Using the crude bound 

/•X 

I \v'(y/ x )\y d y < \W\\lhr)x 

J cx 

the claim follows. □ 
As A and rj are real, we have the self-adjointness symmetry 



3 v>q (x, -a) = S m (x, a). (4.5) 

Also, since e(n(a + 1/2)) = — e(na) when n is odd, we have the anti-symmetry 

Sr,, q {x, a + 1/2) = -Sr,, q {x,a) (4.6) 

whenever q is even. More generally, we have the following inequality of Montgomery 
EE: 



Lemma 4.4 (Montgomery's uncertainty principle). For any q dividing q, we have 

<K<?o) 



aGZ /qoZ:(a,qo) = l 



> T? \ \S v ,q{x,a)\ . 



Proof. We may of course take q to be square-free. Let a n := A(n)e(an)l( n!q )=ir)(n/x), 
S(-j^) '■— a n e(an/qo), and Z := X^n a «' t nen the inequality reads 



\s(— )i 2 > 1 



But this follows from [28] (particularly equation (10) and the final display in Section 3, 
and setting u{p) = 1 for all p\qo). Another proof of this inequality may be found in [19t 
Lemma 7.15]. □ 

Now we consider L 2 estimates on S Vjq (x,a). We have a global estimate: 
Lemma 4.5 (Global L 2 estimate). We have 

l \S Viq (x, a)\ 2 da ^ S v 2 iq (x, 0) logx. 
Jr/z 

Proof. From the Plancherel identity we have 

/ \S^ q {x,a)\ 2 da = r]{n/x) 2 A(n) 2 l {niq)=1 . 
Jr/z 

Bounding A(n) 2 ^ A(n) logx on the support of r](n/x) 2 , we obtain the claim. □ 
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We can largely removal the logarithmic factor in the above lemma by restricting to 
major arcs: 

Lemma 4.6 (Local L 2 estimate). Let Q,R ^ 1, and let E C R./Z be the set 

e - I I II r Q ° 1 Q ° i 1 i 

• U U L 2Q 2 R 2 'q ^ 2Q 2 R 2 ^ 

go^Q (a ,go)=l 

If R$\q, then 

JjS v , q (x,a)\ 2 da^m^y^S^ q (x,0). (4.7) 



Proof. From Lemma 14.41 one has 

All) 



^A- / l^, 9 (^ «) I 2 rf a < J] / 



Sr, n (x, a)\ 2 da 



for any g^. Summing over all qi ^ R coprime to QJj and rearranging, we obtain the 
bound 



\S„ n (x, a)\ 2 da < 



Sgi<K:(9l,Qtt)=l ^aiGZ/ gi Z:(a ll(?1 )=l Je+^L I Sr,,q (x, O.) | 2 da 



Set 



Z^ 9l <fl : ( gi ,Qj|)=l 



Observe that 



and thus 



i 




V , ™l G(R) 

2sqi^R:(qi,Q$) = l 0( 9l ) 



Also, from [26J or [3TJ Lemma 3] one has G(R) ^ logi? + 1.07 for R ^ 6, so by direct 
computation for 1 ^ R ^ 6 we have G(i?) ^ log-R for R ^ 1. 

To conclude the proof, it thus suffices (in view of Lemma 14.5)) to show that the sets 
E + ^ are disjoint up to measure zero sets as a 1; q 1 vary in the indicated range. Suppose 

this is not the case, then E + — and E + intersect in a positive measure set for some 
distinct a\,q\ and a' 1 ,q[ in the indicated range. Thus one has 

, CLq CL\ (2q (2^ , 1 



Qo Qi tfo Qi Q 2R2 



7 A version of this inequality was also obtained in an unpublished note of Heath-Brown, which was 
communicated to me by Harald Helfgott. 
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for some qo, q ^ Q with (a ,q ) = (a' ,q' ) = 1. The left-hand side is a fraction with 
denominator at most Q 2 R 2 , and therefore vanishes. Since qiq[ is coprime with q q' we 

a' 

conclude that — = -k contradiction. The claim follows. □ 

Remark. As pointed out by the anonymous referee, a slightly stronger version of Lemma 
14.61 (saving a factor of e 7 asymptotically) can also be established by modifying the proof 
of [5S1 Theorem 5]; the main idea is to decompose into Dirichlet characters, as in pQ. 

There are several effective bounds on the expression ^zt appearing i n Lemma 14.6} 

see [4"3{ Theorem 8], [10], [121 Theorem 6.12]. However, in this paper we will only need 
to work with the Q = 1 case, and so we will not use the above lemma here. Specialising 
Lemma [4.61 to the case Q — 1, we conclude 

Corollary 4.7. IfO < r < 1/2 and y/l/2r$\ q, one has 

I \S n>q (x,a)\ 2 da^- ^-^S^faO). 

J\\<*\\*/z<r 1 

We can complement this upper bound with a lower bound: 
Proposition 4.8. Let r\ be smooth and ^ r ^ 1/2. Then 



\S„„(x, a)\ da ^ 



(S v 2 jq (x, 0) - + VV"\\v(R)S v , q (x, 0)) 



2 



i- n,<A > / 1 ii n2 n /ii 

N| R/Z <r IMII^r)* + IIWIIiHR) 

Ignoring the error terms, this gives a lower bound of S v 2 q (x, 0), showing that Corollary 
14.71 is essentially sharp up to a factor of 2 when rx is not too large. 

Proof. From the Parseval formula, one has 

S v 2 >q (x, 0) = / S V)q (x, a)F(a) da 
Jr/z 

where F(a) := J2 n r)(n/x)e(— an). In particular, 

/ S v>q (x,a)F(a) da ^ S^^x, 0) — S Viq {x, 0) / |F(a)| da. 

^MIr/Z^ 1 " ^IHIr/Z^ 

and thus by the Cauchy-Schwarz inequality 

ln , , l2 , (^ 2 ,?(^°)-^( a; . )ilH| R/z >J i71 («)l rf «)+ 
D„„(a;, a) ao > 



||a||<r Jr/ z 

By the Plancherel theorem, one has 

\2 



/ \F{a)\ 2 da = S ^^rj{n/x) 

Jr/z 



and hence by (13. ip 

|F(a)| 2 da=||r / ||i 2(R) x + 0*(|| W '|U 1(R) ). 



r/z 
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Similarly, from (13. 3p (with k = 2) one has 

1 



for any a and thus 



' F ^^ 2,lsin M p l'^ + ^""^ 

\F(a)\da^°-?^\\riW + Vv"\W(K). 

<x\\<R/z>r nx 

Bounding 

cot(7rr) ^ — 
nr 

the claim follows. □ 
We can clean up the error terms as follows: 

Corollary 4.9. Let rj be smooth and supported on [c, 1] for some c > 0, and suppose 
that j- ^ r ^ 1/2 and q = \fx$. We normalise |M|l2( R ) = 1. Assume furthermore that 

cx ^ 10 8 (4.8) 

x^10 4 || W '|| l1(r) (4.9) 

log(cx) ^ 5|| W 'IUi(r) (4.10) 

* > 10 8 IMI1~( R) - (4-11) 

rx^20\\ V 'r ] ' + r ] r ] "\\ LHR) \\r ] \\ L ^ R) (4.12) 



Then one has 
and 



S v 2 jq (x, 0) = x{l + O*(0.02)) (4.13) 
/ \S v , q {x,a)\ 2 da ^ 0.94x (4.14) 

J\\a\\<r 



Proof. From Lemma [4.31 and ( I4.8p . ( I4.10p one has 

S v 2 A (x,0) =x(l + C*(0.01)) 
and by Lemma 14.11 and (14. lip we conclude (I4.13P . 

Let us denote the quantity J\\ a \\ <r \S v , q (x, a)\ 2 da by A. By Proposition 14. 8[ (I4.9p and 
Lemma [4.31 we have 

If 1 04 \ 2 

By (14~T2|) . fl4TT3|) we have 
1 04 

^IIW+WIUWMU~(r) o.ois»,o) 

and so 

A ^ 0.97-^ „(x, 0) 2 
x 

and thus by f l4~T3j) 

A ^ 0.94x, 
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which is QUID . □ 

The estimate in Corollary 14.71 is quite sharp when r is small (of size close to 1/x), but 
not when r is large. For this, we have an alternate estimate: 

Proposition 4.10 (Mesoscopic L 2 estimate). Suppose that q = \fx%. Let H 10 2 , and 
write D H (a) := J2h=i e (ha) for the Dirichlet-type kernel. Then we have 



'r/z 

where e is the quantity 



/ \S m {x,a)\ 2 \D H {a)\ 2 da ^ (1 + e) x 8# 2 x||r7||| 
Jr/z 



0.131ogx (eno g \og(2H) + ^y ) )\o g (9H) 

H 2H ' 1 ' ' 



By applying this proposition with H slightly larger than log x and using standard lower 
bounds on \D H (a)\, we conclude that 

/ \S v , q (x, a)\ 2 da ^ (8 + o(l))x\\rj\\ 2 LOO(R) . 

In comparison, Corollary 14. 71 gives an upper bound of (2 + o(l)) ^° f^ gx x\\r] \\ 2 2 / R ) for this 
integral, while Prop osit ion 14 . 8 1 gives a lower bound of (1 — o(l))a;||?y||^2( R ) (if V is smooth). 
Thus we see that the bound in Proposition 14.101 is only off by a factor of 8 or so, if rj is 
close to l[o,ii- It seems of interest to find efficient variants of this proposition in which 
the \Dh{®)\ 2 weight is replaced by a weight concentrated on multiple major arcs, as in 
Lemma I4.6[ as this may be of use in further work on Goldbach-type problems. 



Proof. We may normalise 1 1 ?7 1 ] (r) = 1. By the Plancherel identity, we may write the 
left-hand side as 

H H 

^2 A ( n )ll( n /x)l(n,q) = lH n + h ' ~ h)7]{(n + h! - h)/x)l(r,+h'-h,q)=l- 

h=l h'=l n 

The diagonal contribution h — h! can be bounded by 



H A (71)77(71/2;) log 



X 



which by Lemma 14.31 is bounded by 1.04iJxlogx. Now we consider the off-diagonal 
contribution h 7^ h'. As A(n) l( n ,g)=i is supported on the odd primes p less than or 
equal to x, we restrict attention to the contribution when h — h' is even, and can bound 
this contribution by 

H 

log 2 x\{p ^ x : p + h — h! prime, p + h — h' ^ x}\, 

h'=l l^h^H-.hjth'^h-h 1 

which by the pigeonhole principle is bounded by 

H log 2 x||j» ^ x : p + h — h' prime, p + h — h' ^ x}\, (4.16) 

lsih^H:h^h';2\h-h' 
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for some 1 ^ h! ^ H, which we now fix. Applying the main result of Siebert [45J, we 
have 

\{p ^ x : p + h — h' prime, p + h — h' ^ x}\ ^ 8&I2 — 5 — TT ~ 

log x , 1 - p — 2 

6 p\h-h';p>2 y 

where & 2 '■— 2^Q p>2 (l — t^zjw) is the twin prime constant. We can thus bound (I4.16P 

£ n ^ 

l^h^H:h^h';2\h-h' p\h-h';p>2 F 

If we let / be the multiplicative function 

f(n) := l M=1 fi\n) JJ 

p\n:p>2 

then we have 

n e /w 

p\h-h';p>2 F l^n^H;(n,2)=l:n\h-h' 

whenever 1 ^ h, h' ^ H with h ^ h', so we may bound the preceding expression by 

l<n<H:(n,2)=l l^h^H:2n\h-h' 

We may bound 

E + 1 (4 - 17) 

l^/ts£H:2n|/i-/i' 

so that (I4.16P is then bounded by 

8e ^'("Ei?+ E /(») 

\ n=l lCnsg/f 

By evaluating the Euler product one sees that 

£M = iII (i + /<P)) = £. 

n=l p 

To evaluate the / summation, we observe 



= ^ AlL^ n ( x 



We can bound ^ ^ 

i II (^zip) 1 < 

p\n>2 v ^ ; 



and from [121 Theorem 15] one has 



2n <r 7 1 Wo ^ 2 - 507 
^ e r loglog(2n) 



0(2n) ^ loglog(2n)' 



Alternatively, one can sum / directly by using effective bounds on sums of multiplicative functions, 
as in |35j . This will save a factor of log log .ff in the upper bounds, but in our applications this loss is 
quite manageable. 



EVERY ODD NUMBER GREATER THAN 1 IS THE SUM OF AT MOST FIVE PRIMES 23 

Since n ^ H and H 10 2 , we conclude that 

— — < e 7 log log (2H) + - — 

0(2nJ log log (2//) 

for any 1 ^ n ^ H with (n, 2) = 1 (the cases when n is so small that e 7 loglog(2n) + 
iog 2 io 5 g °(L) can exceed e 7 loglog(2if) + log i 5 g ° ( 2 g) , and specifically when n = 1, 3, 5, can be 
verified by hand). Thus we may bound 

Since 0(n) = 0(2n) when n is odd, we have 

^ <A(n) ^ 2 ^ 
and hence by [35J Lemma 3.5], one has 

£ ^ §(log(2#) + 1.4709) ^ | log(9if). 

Combining all these estimates we obtain the claim. □ 

Remark. As observed by the anonymous referee, when H is an integer, the second 
term in (14.15P may be deleted by using [361 Lemma 5.1] as a substitute for (14.17P (and 
retaining the sum over h', rather than working only with the worst-case h'). 



To deal with S rj q (x,oi) on minor arcs, we follow the standard approach of Vinogradov 
by decomposing this expression into linear (or "Type I") and bilinear (or "Type II") 
sums. We will take advantage of the following variant of Vaughan's identity [46] , which 
we formulate as follows: 

Lemma 4.11 (A variant of Vaughan's identity). Let U, V ^ 1. Then for any function 
F : Z — > C supported on the interval (V, UV 2 ), one has 

\J2Mn)F(n)\ ^Tj + Tjj 

n 

where Tj is the Type I simf] 

Tj:= |£(logn + c d logd)F(dn)| 

d^UV n 

for some complex coefficients (depending on F) with |q| ^ 1, and T/j is the Type II 
sum, 

d>Uw>V 

where g(w) is the function 

b\w:b>V 

9 Strictly speaking, Tj is an average of Type I sums, rather than a single Type I sum; however we 
shall abuse notation and informally refer to Tj as a Type I sum. 
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This differs slightly from the standard formulation of Vaughan's identity, as we have 
subtracted a factor of |logu> from the coefficient g(w) of the Type II sum. This has 
the effect of improving the Type II sum by a factor of two, with only a negligible cost 
to the (less important) Type I term. 

Proof. We split ji = fil^u + /-*!>£/ and A = Al^y + Al>y, where l^u(n) := l n ^u, and 
similarly for l>u, l<y, l>y- We then have 
A = fi * A * 1 

= fxl^u * A * 1 — /il^t/ * Al^y * 1 + fil>u * Al>v * 1 + /j, * Al^y * 1 (4.18) 

= fil^u * log —pLl^u * Al^v * 1 + /il>{/ * Al>v * 1 + Al^y 

We sum this against F, noting that F vanishes on the support of the final term Al^y, 
to conclude that 

J>(n)F(n) = 5>(d)J>>gn)F(dn) 

n d^U n 

- E f(d)J2 F (dn) 

d^UV n 

+ E E ^ d )(9(w) + \ log w)F(dw) 

d>Uw>V 

where 

f(d) := )A(&)- 

b\d:d/U^V 

Observe that when d > U and w > V, then the term [i{d)(\ \ogw)F(dw) vanishes unless 
U < d ^ UV. We may thus bound 

\J2Mn)F(n)\= \J2( lo ^ F ( dn ^ 

n dsiUV n 

+ E \f(d)\\J2 F ( dn )\ 

d^UV n 

+ iEEm%h^)i- 

d>Uw>V 

Note that 

|/(d)|^5>(&)=logd. 
b\d 

Since 

| ^(logn)F(dra)| + \Y,F{dn)\ Xogd = | ^(logn + c d \ogd)F(dn)\ 

n n n 

for some complex constant c<j with \c^\ = 1, we obtain the claim. □ 
Note that as 

(k E A(&)^E A ( 6 ) = 1 °g w 

b\w.b>V b\w 

we have the bound 

\g(w)\ ^ \logw. (4.19) 
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5. Minor arcs 



We now prove the following exponential sum estimate, which will then be used in the 
next section to derive Theorem 11.31 

Theorem 5.1 (Bound for minor arc sums). Let 4a = - + /3 for some natural number 
q ^ 4 with (a, q) — 1 and some (3 = 0*{l/q 2 ). Let 1 < U, V < x, and suppose that we 
have the hypotheses 

UV ^ x/4 (5.1) 
UV 2 > x (5.2) 
U, V ^ 40. (5.3) 

Then one has 

r I '1TJV \ f 5 

i-(logz) log I —— + 4j + 0.89 \UV + -t 



\Sr,, 2 (x, a) | < 0.5-(logx)log ( — +4 J + 0.89 ( UV + -q ) (8 + logg) log(2x) (5.4) 



+ |ai 5f + a39 ^) (log w )log ^ (55) 

+ 0.55-= + 0.78-= log-. 5.6 

x Vu VvJ u 



Furthermore, if a = ±1 and UV < q — 1, then we may replace the term (15.41) in the 
above estimate with 

96 x , , A . , 4eg . _. 

~2 / / ^ 2 log(4x)log . (5.7) 

We now prove Theorem 15.11 By Lemma [4.111 we have 

\Sno,2{x, 1)| < T/ + T n 

where Tj is the Type I sum 

Tj := | (logn + Cdlogd)e(adn)rjo(dn/x)\ 

d<£/V:(d,2)=l n:(n,2)=l 

and Tjj is the Type II sum 

T H := | 2^ l^(d)g(w)e(adw)r] (dw/x)\. 

d>U:(d,2)=l w>V:(w,2)=l 

5.2. Estimation of the Type I sum. We first bound the Type I sum Tj. We begin 
by estimating a single summand 

| ^(logra + Cd\ogd)e(dna)T]o(dn/x)1^2)=i\. (5.8) 

n 

By Corollary 13.21 we may bound this expression by 

•AllCll , HIP' II II-F'IU+r) II-F"IUi(r) \ 

\/ w 2 w 2| sm(2nda)\ 2 1 sm(27r act) | 2 / 
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where F(y) = F d (y) := rj (dy/x)(logy + c d \ogd). We have 

F\y) = -rj' (dy/x)(\ogy + c d logd) + H^^lM 
x y 

and 

ei/// \ d2 ///j /vi , i j\ , n d rf {dy/x) Vojdy/x) 

F {y) = -^r] (dy/x)(\ogy + c d logd) + 2 . 

x A x y y z 



Strictly speaking, F is not infinitely smooth, and so Corollary 13.21 cannot be directly 
applied; however, we may perform the standard procedure of mollifying r)o (and thus 
F) by an infinitesimal amount in order to make all derivatives here well-defined, and 
then performing a limiting argument. Rather than present this routine argument ex- 
plicitly, we shall abuse notation and assume that 770 has been infinitesimally mollified 
(alternatively, one can interpret the derivatives here in the sense of distributions, and 
replace the L l norm by the total variation norm in the event that the expression inside 
the norm becomes a signed measure instead of an absolutely integrable function). 

Since d ^ U and 77 is supported on [1/4,1], r)(dy/x) is supported on [x/4d,x/d]. In 
particular, | logy + c d \ogd\ ^ \ogx. We thus have 

x 

\\F\\l^(k) ^ ||77o||Li(R)^logX 

H-F'IUi(R) ^ lholU 1 (R) lo S x + IM|L->(R)log4 
II^'IIlhr) ^ ||77o||i,i( R )^logx + 2||77o|U°o (R) ^log4+ |ho|U-( R )^- 
Routine computations using (jl.7p show that 



H^olU^R) = 1 


(5.9) 


MU°°(r) = 41og2 


(5.10) 


K|| Ll(R) = 81og2 


(5.11) 


holU-(R) = 16 


(5.12) 


K'IIli(r) = 48. 


(5.13) 



Note that H^Hl^r) an d ||7/o' IU X ( R ) can a ^ so ^ e interpreted as the total variation of the 
functions 770 and 7] ! respectively, which may be an easier computation (especially since 
i]q is not a function before mollification, but is merely a signed measure). Inserting 
these bounds, we conclude that 

x 

II^IUi(R) ^ d l ° gX 

\\F'\\ L i iR) <8(log2)log2a; 

||F'|| L i (R) ^48-log4x. 

x 

We conclude that 

_ fix, x, 4(log2)log2x d 24log4x 

< £_ -on [h l"g * + 4dog 2) log 2 X , , -j-^^ 

(5.14) 
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We first control the contribution to (15.141) when d ^ q/2. In this case, d is not divisible 
by q; as (a,q) = 1, this implies that ad is not divisible by q either. We therefore have 

1 _9/2 = 1 
q q 2 2q 

and thus 



||4efa|| R/z > ||ad/g|| R/z - d\/3\ > - - \ = — (5.15) 



< . . . 7- .. < 2q (5.16) 



| sin(27rda)| ~ |sin(7r/4g)| 
thanks to ( 12. ip . Thus contribution here (I5.14p of the d ^ q/2 terms are bounded by 



4(log2)log2x V l (d 2) min(2g, , 1 



sin(27rda)|* 



By Corollary 13.51 one has 



V l( d)2 ) min(2g, , ) ^ -glog4g + 4g 

so by symmetry we may thus bound the contribution of the d ^ q/2 terms to (15.141) by 

2 

2 log 2 log 2a; {—q log 4g + 4g). 

7T 

Now consider the contribution to (15.141) of a block of the form 2jg + | < d ^ 2(j + l)g + | 
for a natural number j with j ^ — ~. This contribution can be bounded by 

E. A * , „m „ 4(log2) log2a;\ 

^ l0SI + 4<l0g2) '° g21 ' S in(2 T ^)| J ' 



which by Corollary 13.51 is bounded by 



_ log a; + 8 (log 2) log 2x + 4 (log 2) log 2x—q log 4g 



which we can crudely bound by 

x 2 

- \ogx + 4(log2) log2x(— glog4g + 4g). 

2j<? + | vr 

Combining all the contributions to (j5.14j) . we obtain the bound 

+ (-7T- + T)(-? 1 og4g + 4g) x 4(log2)log2x. 
2g 4 7r 



By the integral test, one has 



^ 1 rUV+2q x 

2q 4 

= — log + 4) 

2g g 
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and so 

x 2UV UV 5 2 

Tj < — log( +4) logx + (— — + -)(-glog4g + 4g) x 4(log2) log2x. 

We can write 

2 2 

— g log 4g + Aq ^ — g(8 + log q) 

71 7T 

and 

x-x 41og2 0.89 

1 7T 

and so 

£ 2[/y 5 

T 7 < 0.5- log( + 4) logo; + 0.89CUV + -q)(8 + \ogq) hgx (5.17) 

q q 2 

which gives the term (j5.4p . 

Now suppose that a = ±1 and UV < q — 1. By the symmetry (14. 5 p we may take a — 1, 
thus a = ^ + ^ 4( g -i) • l n particular, for d ^ [/V one has d ^ q — 2 and 

therefore < 27r<ia < tt/2. In particular, we have 



sin(27T(ia) ^ sin 
From (I5.14p . we conclude that 



2nd 



4(9-1) 



24 log 4x ^4 2 Trrf 

i/ ^ > a cosec 



x ^ ^ 2(g-l)- 

The function d ^ d cosec 2 2 (q-i) * s convex on [0, 7r] (indeed, both factors are already 
convex), and so by the trapezoid rule 

241og4x H~ 3 / 2 2 Try 

T/ ^ / j/cosec — — dy. 

x A/a 2(g - 1) 

Using the anti-derivative 

ycosec 2 y dy = log | siny| — y cot y 



/ 



we can evaluate the right-hand side as 

/ 2 (g- l), 2 241og4x vi„=w/2-»r/4( ff -l) 

( ) ~ (log | sui 2/| - y cot y) \ v y J Kq l^ q 

The function ycoty varies between and 1 on [0,7r/2], and thus 

m ,2(g-l) N ,241og4x„ tt 

Tj < — 2 — logcot + 1 

ii x 4{q — 1) 

which, on bounding cot - ^ < *SSz2l ^ M. anc i Hi£zll < ^2 gj ves 



7V<^-^log(4x)log(^) (5.1* 



which is the alternate contribution (15. TJ . 
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5.3. Estimation of the Type II sum. We now control T/j. From f ll.Sp and the 

triangle inequality, we thus have 

T n ^4 J F(W) W (5.19) 

where 

F ( w ) \ ^( d )Md,2)=i\x/2W,x/w](d)g(w)l^ i2 )=il[w/2,w](w)e(adw)\. 

d>Uw>V 

Observe that F(W) vanishes unless 

V < W < ^, (5.20) 
and we henceforth restrict attention to this regime. 

From ( jgjZDD , O we see that the intervals [x/2W, x/W], [W/2,W] both have length 
at least 2. We now apply Corollary 13.91 with M := q/2 and use (14. 19j) to bound 

^o^0t + j) I/2 ( l ^ j + 1 ) 1/2a1/2bI/2 (5 ' 21) 

wher<£B 

5 := inf ||4ja|| R / z 



1 



(d,2)=l 



d£[x/2W,x/W] 

B := ^2 1 («',2)=ilog 2 w. 
iue[W/2,W] 

From the computation ( I5.15P we have 



8 > — . (5.22) 



2g 

Now we bound A and 5. Clearly we have 

x 

Similarly, for £? we bound log 2 u> by log 2 W 7 , and conclude that 

W 

\B\ ^ ( T + l)log 2 W 

By fl5T2TJ|) we thus have 

1 1 ^ 4 W 

ISI ^ — W\oz 2 W 



10 One could achieve a very slight improvement to the A factor by exploiting the fact that /z 2 (n) 
vanishes when n is divisible by an odd square, but we will not do so here to keep the exposition simple. 
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and thus by fICT]) . flB^j) 

Fiw) < T ( T + 2g)1/2( 2^ + 1)1/V/2logW/ - 

Crudely bounding (a + fe) 1 / 2 ^ a 1 / 2 + fe 1 / 2 we thus have 

F{W) ^ ^(^^ + \"SxW + "i^= + ^^=r) logW. 

We integrate this expression over (I5.20p against the measure to bound (I5.19p . To 
simplify the computations slightly at the cost of a slight degradation of the numerical 
constants, we bound log W crudely by log jj for the middle two terms for F(W). Since 

dW , x , Vx 
2 log — — log 



we thus obtain the bound 




_ ^ 1.1 / 1 x r- x \ . x . Vx 
Tit ^ — — = h V2 - log log 

Combining this with (I5.17P and (15.181) we obtain the claim. 

Remark. When /3 is very small (of size 0(1/ x) or so), one can eliminate most of the 
0.5~logaTog(^ + 4) term in (I5~3j) by working with S Vjq (x,a) - ^-S VtQ (x, 0) in- 
stead of S v> 2(x, a), which effectively replaces the phase e(an) with the variant phase 

q ) =1 . The main purpose of this replacement is to delete the 
non-cancellative component of the Type I sum (which, for small values of (3, occurs when 
d is divisible by q), which would otherwise generate the |log 2 a; type term in (ll.lOp . 
Such an improvement may be useful in subsequent work on Goldbach-type problems, 
but we will not pursue it further herd* 2 !. 



6. Proof of Theorem 11.31 



In this section we use Theorem 15.11 to establish Theorem II. 3[ basically by applying 
specific values of U and V. With more effort, one could optimise in U and V more 
carefully than is done below, which would improve the numerical values in Theorem 11.31 
by a modest amount, but we will not do so here to keep the exposition as simple as 
possible. 

We begin with (II. 9p . We apply Theorem 15. II with 

U := -x 2 ^ ,V := \x 2 '\ 

A 1 



"By avoiding this step, one could save a modest amount in the numerical constants in the estimates, 
but at the cost of a more complicated argument. 

12 For very small values of q, it is likely that one should instead proceed using different identities than 
Vaughan-type identities, for instance by performing an expansion into Dirichlet characters instead. 
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As x ^ 10 20 , the hypotheses of the theorem are obeyed, and we conclude that 

\S v , 2 (x, a) | ^ 0.5- logx log(x 4/5 ) + 0.89(^x 4/5 + -q) (8 + log q) log 2x 
q 8 2 

+ (0.1— + 0.39^=) log(8x 1/5 ) log(2x) 
Vq JxTq 

+ 2.3x 4/5 log(2x 3/5 ). 
As x ^ 10 20 and q ^ x/100, one can compute that 

log(8x 1/5 ) log(2x) ^ ilogx(logx+ 11.3) 
5 

(8 + log q) (log 2x) ^ 1.1 logx(logx + 11.3) 

log(2s 3/5 ) ^ 0.011 log x(log x +11.3); 
inserting these bounds and collecting terms, we conclude that 
\S, n2 (x, or) I < (0.4- + 0.1— + 2.45^- + 0.39^L= + 0.149x 4/5 ) logx(logx + 11.3). 

q x /i \Mq 

Since 100 ^ q ^ x/100, one has 

x x 
0.4- ^ 0.04 — 

and 

x x 
2.45— ^ 0.245 



x/q yfxjq 

and the claim (11. 9p follows, after using Lemma H~T1 to replace S V:2 (x, a) with S vm (x,a) 
(at the cost of replacing the 0.149x 4//5 term with 0.15x 4 / 5 ). 

Now we verify (11.101) . Here we use the choice 

x 

U:=-- V:=q. 
T 

The hypotheses of Theorem 15.11 are easily verified, and we conclude that 

\S v ,2(x, a)\ ^ 0.5- log:rlog( — + 4) + 0.89(- + -q)(8 + logg) log(2x) 
q q z q 2 

+ (0.1^ + 0.39^=) x 31og 2 g 
JxTq 

+ (0.55 * + 0.78—) x 2 log q. 



Since q ^ x 1 ^ 3 and x ^ 10 20 , one has 



./• x 



Vx7¥ " v^ 

and 



' 0.00l4: 



^Jxfq " v^ 

and 

q ^ 0.001- 

Q 
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and so 

x 2x 
\Sr,,2{x, a)K - log(2a;)[0.5 log( — + 4) + 0.9(8 + logg)] 



+ (0.301 log 2 q + 2.66 logg) . 



Since 



,2x 



0.5 log(— + 4) + 0.9(8 + log q) «C 0.5(log(2x + 4g 2 ) + 14.4) 

0.5(log(2ar) + 15) 

(using the hypotheses q ^ x 1 ^ 3 and x ^ 10 20 ) and 

0.301 log 2 q + 2.66 logg ^ 0.301 log g(logg + 8.9) 

we obtain the claim fll.lOp . after using Lemma PO to replace S V) 2(x,a) with S Vtqo (x,a) 
(and replacing 0.301 with 0.31). 

Now we prove p. lip . Here we use the choice 

x 

U := ; V := x/q. 

[x/q) 2 

Again, the hypotheses of Theorem 15.11 are easily verified, and we conclude that 

X T 

\S v , 2 (x, a) | ^ 0.5- log x log 6 + 0.89(-g) (8 + log q) \og{2x) 
q I 

ry rf rf 

+ (0.1^ + 0.39^=) x 3 log 2 - 
y/q y/x/q q 

+ (0.55 X = + 0.78—^=) x 2 log -. 

\Jx/(x/q) 2 y/x/q Q 



Since q ^ x 2 ^ 3 and x 10 20 , one has 



and 



and 



\fx/(x/q) 2 y/x/q 



' < 0.001- ' 



- < O.OOlg 
9 



and thus 



|^, 2 (x,a)| ^ 3.12g(8 + logg)log(2x 

+ (1.181 log 2 - + 2.66 log X 



q y/x/q 



Writing 



1.181 log 2 - + 2.66 log - 1.181 log -(log - + 2.3) 
q q q q 



we obtain the claim (II. lip , after using Lemma I4TT1 to replace S^ )2 (x, a) with S mo (x,a) 
(and replacing 1.181 with 1.19). 
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Finally, we establish (j!.12p . Here we use the choice 

to ensure that UV < q — 1. We may now use the term (15. 7p and conclude that 
\S V)2 (x,a)\ < ^i ( X ,, 2 log(4a;) log — 

7T Z (x/q) Z 7T 



+ (0.1 — + 0.39^=) x 3 log 2 (1.02x/g) 



We can bound 



(0.55 ; X +0.78 ; X ) x 21og(1.02z/o). 



Acq 

log ^ log(x/4) 

7T 



and hence 

log(4a;) log(x/4) ^ log 2 x. 
Also, as q ^ x 1 ^ 3 and x ^ 10 20 , onehas 



x x 
< 1.02- 



y/x/{1.02x/q) 2 ^Jxfq 



and 



and thus 



' < 0.001- ' 



y/q JxTq 
|6,, 2 (x,a)| ^ — 7777777 log x 



7r 2 (x/qY 
+ 1.19— ^=log 2 (1.02x/g) 

+ 2.67— ^=log(1.02x/g). 

The last two terms can be bounded by (1.19 log x (log + 2.3). Since 

, 1.02a; , x 

1.19 log < 1.2 log - 

q q 

and 

, l.Q2x x 

log + 2.3 ^ log - + 2.35 

q q 



and 

96 



7T 2 



< 9.73 



the claim (I1.12p follows, after using Lemma I4TT1 to replace S Vt 2(x,a) with S vm (x,a) 
(and replacing 2.35 with 2.4). 
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7. Major arc estimate 



In this section we use Theorem 11.51 to control exponential sums in the "major arc" 
regime a = 0{Tq/x). To link the von Mangoldt function to the zeroes of the zeta 
function we use the following standard identity: 

Proposition 7.1 (Von Mangoldt explicit formula). Let rj : R — > C be a smooth, com- 
pactly supported function supported in [2, +oo). Then 

J2Hn)v(n) = /(i - -s^-Hv) dy-J2 [ v(y)y p ^ dy (7.1) 

Jr x x p j R 

where p ranges over the non-trivial zeroes of the Riemann zeta function. 



Proof See, for instance, [19j §5.5]. □ 

Proposition 7.2 (Major arc sums). Let To be as in Theorem \1.5l Let rj : R — > R + be a 

smooth non-negative function supported on [c, d\, and let x,a G R be such that cx ^ 10 3 
and 

\a\ ^ -r^-. (7.2) 

Atcc'x 

Then 

f 1 T 

\S v ,i(x,a)-x / r)(y)e(axy) dy\ *C A^-^x + 2.01c" 1/ V /2 iV(To) ||r/|| L i (R) , (7.3) 

where A is the quantity 

A := 60|M| L1(R) + 32c , ||r/|| L1(R) + 4(c , ) 2 h // ||Li(R) (7.4) 
and N(Tq) is the number of zeroes of ( in the strip {0 ^ 3?(s) ^ 1; ^ S(s) ^ T }. 



As should be clear from the proof, the constant A can be improved somewhat by a 
more careful argument, for instance by exploiting the functional equation for (, which 
places at most half of the zeroes to the right of the critical line 9ftp =1/2. However, this 
does not end up being of much significance, as the right-hand side of (17. 3p is already 
small enough for our purposes (thanks to the large denominator of T ). It may appear 
odd that the condition (17. 2p allows for |a| to exceed 1, but the estimate (17. 3p becomes 
weaker than the trivial bound in Lemma 14.31 in this case. 

For our particular choice of To, namely To := 3.29 x 10 9 , the quantity N(Tq) can be 
bounded by 10 10 (indeed, the value of To we have chosen arose from the verification 
that the first 10 10 zeroes of ( were on the critical line). 

Proof. We have 

S Vt i(x, a) = 2^ A(n)r)(n/x)e(an). 

n 

Applying (17. ip . we conclude that the left-hand side of (17. 3p is bounded by 
/ -3^ — v(vM dy + Y" 

J R y 3 -y *-? 




i](y/x)e(ay)y p 1 dy 
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Crudely bounding by on the support of r](y/x), the first term is at most 

1(H)- 



2c 3 x 2 ||?7||l 1 (r)- Now we turn to the second sum. We write p = o + it with < a < 1 



and t G R. 

We first consider the contribution of those zeroes with \t\ ^ To. By hypothesis, a = 1/2 
for these zeroes, and we may bound this portion of the sum by 

/ v(y/x)y~ 1/2 d y 

p:\t\<T Jr 

which we may bound in turn by 

2c- 1 / 2 x 1 /2 iV(To) || r/ || L1(R) _ 

As To ^ 10 3 and cx ^ 10 3 , we may clearly absorb the much smaller error term 
2c~ 3 s _2 ||?7||ii( R ) into this term by increasing the 2 factor to 2.01. 

Finally, we consider the terms with \t\ > T . We may rewrite a single integral 

I / V(y 'I / x)e(ay)y p ~ 1 dy\ (7.5) 
Jr 

as 



/ f(y)e i( - 2nay+noey) dy\ 
Jr 



where f(y) := r](y/x)y a x . Since 

e i(2nay+tlogy) _ \ ^_^^ 2na y + t log y) 

i(2ira + t/y) dy 

with the denominator non- vanishing on the support of rj(y/x) thanks to (17. 2p . we may 
integrate by parts and write the preceding integral as 

(/ 1 ' f(y) j e^v+t^y) dy\. 



dy \27ra +t/y* 

A second integration by parts then rewrites the above expression as 
d ( 1 d ( 1 



R 



dy \2ira + t/y dy \2ira + t/y" 
which we then bound by 

d ( 1 d ( 1 



f(y) ) ) e i{2nay+tlogy) dy\ 



R 



dy \2ira + t/y dy \2ira + t/y 
The expression inside the absolute value can be expanded as 

t 2 /y A - Anat/y 3 



f(y) I dy. 



(2na + t/y) 



■i 



-f(y) 



, Zt/y 2 
(2ira + t/y) 3J {V) 

+ (2na + t/y) 2f 
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On the support of / one has y ^ c'x. From (17. 2p we may thus lower bound 

1*1 



and upper bound 



\2ira + t/y\ ^ — 
1 /yi 2y 



\e/y* - ATtat/y*\ < 



and so the preceding integral may be bounded by 

32 



\f{y)\dy 

24 f 

+ ¥ y\f(y)\dy 

J R 

+ i / y 2 \f"(y)\dy. 



t 2 „ 

Since ^ a ^ 1, we have the crude bounds 

\f(y)\ < v(y/ x ) 

\f'(v)\ < -W(y/x)\ + ~7](y/x) 
x y 

12 1 

\f"(y)\ < -^W'(v/ x )\ + —W(y/ x )\ + -^v{y/ x ) 

x z xy y 1 

and so (on bounding y from above by c'x) we may bound (17.51) by 

A- 

where A is the quantity defined in (I7.4p . From jlOl Lemma 2], we may bound 

1 1 „ T s 1.34, , T 

p:\t\>T U 

which under the hypothesis T ^ 10 3 can be bounded above by 

loglo 
3T ' 

The claim then follows. □ 



8. Sums of five primes 



In this section we establish Theorem 11.41 We recall the following result of Ramare and 
Saouter: 

Theorem 8.1. If x is 1.1 x 10 10 is a real number, then there is at least one prime p ^ x 
withx-p^ 
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Proof. See [40 , Theorem 3] . One of the main tools used in the proof is Theorem 11.51 
By using the more recent numerical verifications of the Riemann hypothesis, one can 
improve this result; see JTH]. However, we will not need to do so here. □ 

By combining the above result with Theorem 11.61 we see that 

• Every odd number between 3 and (2.8 x 10 7 )iVo is the sum of at most three 
primes. 

• Every even number between 2 and (2.8 x 10 7 ) 2 A^ is the sum of at most four 
primes. 

• Every odd number between 3 and (2.8 x 10 7 ) 3 A^ is the sum of at most five 
primes. 

In particular, Theorem Ol holds up to (2.8 x 10 7 ) 3 iVo > 8.7 x 10 36 . Thus, it suffices to 
verify Theorem 11.41 for odd numbers larger than 8.7 x 10 36 . 

On the other hand, in [24J it is shown that every odd number larger than exp(3100) is 
the sum of three primes. It will then suffice to show that 

Theorem 8.2. Let 8.7 x 10 36 ^ x ^ exp(3100) be an integer. Then there is an integer 
in the interval [x — Nq, x — 2] which is the sum of three odd primes. 



We establish this theorem by the circle method. Fix x as above. We will need a 
symmetric, Lipschitz, L 2 -normalised cutoff function rji which is close to l[o,i]- For sake 
of concreteness we will take 

Vl (t) :=(l-10dist(t, [0.2,0.8]))+, 

which is supported on [0.1, 0.9] and obeys the symmetry 

7^(1 -t) = r) X {t) for all t G R. (8.1) 

For future reference we record some norms on 771 (after performing an infinitesimal 
mollification): 







IMU 2 (r) = y \ 


(8.2) 


1 L°°(R) = 1 


(8.3) 


II 1 7 


(8.4) 




IKIU~(r) = 10 


(8.5) 


IKIUhr) = 2 


(8.6) 


IMllki(R) = 1 


(8.7) 


IKIUhr) = 40 


(8.8) 


vWi + ViVi\\l^(r) = 40. 


(8.9) 
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For a parameter K ^ 1 to be chosen later, we will consider the quantitjO 

Y Y H n i) 1 (n 1 ,^t)=iVi(ni/x)A(n 2 )l {n2 ^ )=1 r] 1 ( y n2/x) 

niwMl^hxMMi^No/S (8.10) 

x A(n 3 ) l^^^jj^^ =1 r] (Kn 3 / x) l x=ni+n2+na+hl+h2+h;i 

Observe that the summand is only non-zero when ni,n 2 ,n 3 are odd primes (of magni- 
tudes less than x, x, and x/K respectively) with ri\ + n 2 + n 3 G [x — N , x — 2]. Thus, to 
prove Theorem I8.2[ it suffices to show that the quantity (I8.10P is strictly positive. Ap- 
plying the prime number heuristic A ps 1 and (18. 2p . we see that we expect the quantity 
(1810|) to be of size roughly lx^K- l (N Q /?>f. 

Note that we have made the third prime n 3 somewhat smaller than the other two. 
This trick, originally due to Bourgain [3], helps remove some losses associated to the 
convolution of the cutoff functions rji,r] . 

By Fourier analysis, we may rewrite (I8.10p as 

J S m ^ i (x,a) 2 S T)0t ^ Rt (x/K,a)D No/3 (a) 3 e(-xa) da (8.11) 



'r/z 

where D^ / 2 (a) is the Dirichlet-type kernel 

d Nq/2 {u) ■■= Y e ( nc 0- 

It thus suffices to show that the expression is non-zero for some K. It turns out that 
there is some gain to be obtained (of about an order of magnitude) in the weakly minor 
arc regime when a is slightly larger than T /x, by integrating K over an interval inside 
[1,T ] that is well separated from both endpoints; for instance, one could integrate 
K from 10 3 to 10 6 . Such a gain could be useful for further work on Goldbach type 
problems; however, for the problem at hand, it will be sufficient to select a single value 
of K, namely 

K : = 10 3 . 

As mentioned previously, the expected size of the quantity (18.111) is roughly |^(iVo/3) 3 . 
This heuristic can be supported by the following major arc estimate: 

Proposition 8.3 (Strongly major arc estimate). We have 

„ S rn,^( x i a f S „ n ,/7Tk^ x I K ^ a)D No / 3 (a) 3 e(-xa) da 

II <^__b_ /U'v ' " 

11^/2^3.67^ (8.12) 



^(iV /3) 3 (^ + O*(0.1)). 



13 The ranges of the hi,hi,h-s parameters are not optimal; one could do a bit better, for instance, 
by requiring 1 ^ hi, ha ^ -No/20 and 1 < h 3 < 0.9iV instead, as this barely impacts Proposition 18.41 
but reduces the upper bound in (|8.19p by almost a factor of three. However, we will not need this 
improvement here. 
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Proof. From the mean value theorem one has 

N T 

D No/3 (a) = (JV /3)(1 + (9*(27r(AT /3)|H| R/z )) = (tf„/3)(l + 

when \\a\\ R/z ^ A. Since JV = 4x 10 14 , T = 3.29 x 10 9 , and O 8.7 x 10 36 , we 
conclude that 

D No/3 {a) = (N /2)(l + O*(10- 10 )) (8.13) 
(with plenty of room to spare). Also, by Proposition l7.2[ for a in the interval r — 



3.6nx ' 3.6irxi 

(and in fact for all a in the wider interval [— ip 1 , t^ 1 ]), one has 

S^i^/if, a) = ~fjo(ax/K) + 0*(a£^-^ + 2Mx 1 / 2 K- 1 / 2 N(T Q )\\r ]Q \\ LHR) ) 

where 170(a) := J R Vo(y) e i. ax y) dy and 

A 2 ■= 60||r7o|Ui( R ) + 32||^|| L i (R) + 4||<|| £ i (H) . 
From O, floTTT]) . f |5TT3|) one has 

A 2 = 252 + 256 log 2 ^ 330 
and with x ^ 8.7 x 10 36 , K = 10 3 , T = 3.29 x 10 9 , and N(T ) < 10 10 , we conclude that 

S Vo ^x/K,a) = ^-(f) (ax/K) + O*(10~ 6 )). 

By Lemma T4. II and (" 18 . 1 3[) (and bounding f)a(ax) as 0*(1) whenever necessary) we then 
have 

S ,o,^ {x/K ' a)DNo/3{a)3 = ^^(^W + X 10 " 6 ))- 



Let us consider the contribution of the error term ^ '(1.1 x lO" 6 ) to fH - By 
Corollary 14. 7[ we may bound this contribution in magnitude by 

1 1 X 1Q-6 *W 3 ) 3 2 ^ (xQ ) 

log x 

which by Lemma 14.31 and the bounds To = 3.29 x 10 9 , x ^ 8.7 x 10 36 can be safely 
bounded by 

io _ 3 * 2 (iVo/3) 3 
K 

Thus it will suffice to show that 

2 

S V1 ^(x, a) 2 f) (ax/K)e(-xa) da = x(- + C*(0.09)). (8.14) 

1 1 3ttx 

We apply Proposition 17.21 again to obtain 



log T 

■] 

where 171(a) := f R r)i(y)e(axy) dy and 



S vul (x, a) = xr)i(ax) + 0*{A 1 ^-^x + 2Mc~ 1/2 x 1/2 N(T )\\ Vl \\ L i {R) ) 

61 



A, := 60\\vi\\m R ) + 32c' 1 ||^|| il(R) + 4(c' 1 ) 2 ||^|| Ll(R) 
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with ci := 0.1, c[ = 0.9. From (JE3D, (EUD, M) one has 

A 1 = 229.2 

and with x ^ 8.7 x 10 36 , T = 3.29 x 10 9 , and N{T ) sC 10 10 , we conclude that 

S^O, a) = x(?7i(q;x) + e>*(HT 6 )). 

Bounding 170 as 0*(1), we may thus express the left-hand side of (I8.14p as the sum of 
the main term 

x 2 / fji(ax) 2 fj (ax / K)e(-xa) da 

1 d.OTTX 



and the two error terms 

O*(10- 6 x 



O*(10- 6 x \S Vul {x,a)\ da) (8.15) 



and 

O*(10 - V / \vi(ax)\ da). (8.16) 

We first estimate (|8.15p . From Corollary 14. 71 and Lemma [4.31 we have 

/ Tf) I^X,l(^«)l 2 < - lo g ( 2 2 T /3. 67r ) X 41 °S 2 X L043: 

■ / M<3^ 1 lol^ 

and so by Cauchy-Schwarz we may bound ( I8.15P by 



log X 

since O 8.7 x 10 36 , T = 3.29 x 10 9 , we may bound (ETTSj) by 0.02s. A similar (in 
fact, slightly better) bound obtains for (18.161) (using Plancherel's theorem in place of 
Corollary I4.7[) . We conclude that it will suffice to show that 

fjx{ax)%{ax/K)e(-xa) da = -(1 + O*(0.05)). 

x 

1 1 O.OTTX 

By Lemma 13.31 we have 



l^i (ax) I < 



2n\a\ 7r|a|' 

from this and the choice To = 3.29 x 10 9 one easily verifies that 

fjx(ax) 2 f) (ax/ K)e(— xa) da = O*(0.01— ) 

|q|> T X 

and so it remains to show that 

fj^axffioiax I 'K)e{-xa) da = -(1 + £>*(0.04)). 

x 



The left-hand side can be expressed as 

1 

x 




771(5)7/1(1 — s — t/K)rj Q {t) dsdt; 



R JR 



R 
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as r)o has an L 1 norm of 1, it thus suffices to show that 

77iO)t?i(1 - s - t/K) ds = l + O*(0.04) 

for all t = 0*(1). By (jHHJ and (JES]) one has 

Vl (l-s- t/K) = m(s) + O*(10/K) 
and so (by (18.41) and the /^-normalisation of r] ) 

[ 7 7l (s)7 7 i(l -s- t/K) ds = l + O*(10/K). 
Jr 

Since K = 10 3 , the claim follows. □ 



In view of the above proposition, it suffices to show that 



x 2 



^ \S Vl ^{x,a)Y\S mj ^{xlK,a)\\D N ^{a)\ 6 da ^ 0.56-(iV /3) 2 . 

(8-17) 



We have the following L 2 estimate: 
Proposition 8.4 (L 2 estimate). We have 

\S vli ^(x,a)\ 2 \D No/3 (a)\ 2 da ^ 7.09(iV /3) 2 x. 



Proof. By Proposition 14. 101 we have 
/ \S m ,^t(x,a)\ 2 \D No/3 (a)\ 2 da ^ 8H 2 x 1 + 

Jr/z \ 



0.13 logx (enoglog(2g) + ^gjgy) loggg ) 
H + 2H 



where H := |_ A^o / 3j . Since log a; ^ 3100 and iVo = 4x 10 14 , a brief computation then 
shows that 

/ \S m ^(x,a)\ 2 \D No/3 (a)\ 2 da ^ 8.001(iV /3) 2 x. 
Jr/z 

Meanwhile, from Corollary S3] (using the values x ^ 8.7 x 10 36 , T = 3.29 x 10 9 , c = 0.1, 
and the estimates (I8.3p . (18. 7p . (18. 9p to verify the hypotheses of that corollary), we have 



/ \S m ^ t (x,a)\ 2 da ^ 0.92x. 



and thus by (I8.13P 

/ T \S m ^(x,a)\ 2 \D No/3 (a)\ 2 da^0.919(N /3) 2 x. 

Subtracting, one obtains the bound. □ 

In view of this proposition and Holder's inequality, it suffices to show the L°° estimate 
that 

\S m ^^x/K,a)\\D No/ ,{a)\ ^ 0.078x(iV /3)-| (8.18) 
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whenever ||a|| R / z ^ ; comparing this with Lemma I4.3[ we see that we have to 
beat the "trivial" bounds in that lemma by a factor of roughly 13. By the conjugation 
symmetry (I4.5P we may assume that 



Tp < < 1 
3irx ^ ^ 2 ' 



We first consider the case of the weakly major arc regime 

< a < 



3.6nx Attx 
By the computations in the proof of Proposition 18.3} we have 

S *,y&mW K ' a) = ^o(ax/K) + Or {1.1 x 10~ 6 )) 
in this regime, so by the trivial bound \DN / 3 (a)\ ^ (No/3) it suffices to show that 

\fj (ax/K)\ 0.077. 



\r) {ax/K)\ ^ 



By Lemma 13 .3[ we have 

2ir\a\x/K 

Since a ^ we conclude from ( 15.10)) that 

\Vo(ax/K)\ < , 

-to 

and the claim follows (with plenty of room to spare) from the choices K = 10 3 , To = 
3.29 x 10 9 . 

Next, we observe from Lemma [4.31 and (15.111) (and the lower bound x/K ^ 8.7 x 10 33 ) 
that 

\ S ^^m^l K ^)\ < S VoA (x/K,0) < 1.01-|; 
combining this with the crude bound 



... 2 1 1 

1 — e(a) sm to 2a 



(by fl2HD), we have 

1 01 x 

I S m . 0/^ «) 1 1 ^0/3 («) I < 



which gives (18.181) whenever 

20 

a ^ — — 

and so we may assume that 

^<a<#. (8.19) 
In this regime we use the crude bound |Djy / 3 (a)| ^ N /3, and reduce to showing that 

\ S n<h ^jK t ( X / K ^)\^ - 078x - ( 8 ' 2 °) 
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We may write 4a = ^ + 0*(l/q 2 ) for some 
in particular 

Nn TTX 

80 ^ KTq 
We first consider the weakly minor arc case 

v2/3 - - nX 



{x/KY /A < q < 



KTq 

In this case we may use (11.121) to bound the left-hand side of (18.201) by 

(9.73 r7 i— \og 2 (x/K) + 1.2— pl= log -^(log + 2.4))^. 
(x/Kq) 2 \/x/Kq Kq Kq K 

Since x/q ^ Tq/tt, we can bound this by 



(9.73(^) 2 log 2 :r + 1.19^= log(T /7r)(log(T /7r) + 2.3))^. 

As logx ^ 3100 and T = 3.29 x 10 9 , the expression inside the parentheses is certainly 
less than 0.078 (in fact it is less than 0.004). Note here that the computations would 
not have worked if we had used the weaker bounds (11.91) . (II. lip in place of (11.121) . 

Next, we consider the intermediate minor arc case 

(x/K) 1/3 ^q^ (x/K) 2/3 . 
Here, we use (ll.9p to bound the left-hand side of (I8.20p by 

(0.14— + 0.64—^ + 0.15?T 1/5 ) logy (logy + 11.3) x 

Vv/q K 

where y := x/K. Using the bounds on q, this can be bounded in turn by 

(O.S^T 1 / 6 + 0.027?T 1/5 ) logy(logy + 11.3) x -|. 

Since y ^ 8.7 x 10 33 , the expression (OSy-^ + OAby- 1 / 5 ) log y(logy + 11.3) is bounded 
by 0.078 (in fact it is less than 0.013), so this case is also acceptable. 

Finally, we consider the strongly minor arc case 

^° _ 1 <; q <: (x/K) 1 ' 3 . 
80 H y 1 1 

Note that this case can be vacuous if x is too small. In this case we use (ll.lOp to bound 
the left-hand side of (I8T2DD by 

[0.5-log(2x/fT)(log(2x/fO + 15) + 0.31— logg(logg + 8.9)] x 
q y/q K 

Since \og(2x/K) ^ logs ^ 3100 and q ^ ^ — 1 = 5 x 10 12 — 1, the expression in 
brackets is bounded by 0.078 (in fact it is less than 0.0002). This concludes the proof 
of (18.181) in all cases, and hence of Theorem 11.41 
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